RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number; / Q J kSZ > 

Source: /Ajo 
-©ate Processed by*STIC: ^k/^/O^ ~ 

s 

* THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 
PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF Til IS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 703-308-4212; FAX: 703-308-4221 
Effective 12/13/03 : TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

h(tp:/Avww.uspto.govAvcb/ofTiccs/pac/chcckcr/chkr41no(c.h(m 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware thai there is 
a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for URftisk/CD-Rom or replacement disk/CD-Rom. ^ * -\ 
Any reply including a sequence listing in electronic fornrshoold NOTTe sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

I. EFS-Bio (<ht(p:/Av\vw.uspto.gov/ebc/cfs/downloads/documcnts.h(m> , EFS Submission 

User Manual -cPAVE) 
2 U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/03): 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two. 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, oc other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Roonv4B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Pftcc, Arlington, VA 22202 

Revised 10/03/03 




Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

) Wrapped Nuclcics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

_Non-ASCIJ The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text. 

Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

Patentln 2.0 A "bug" in Patentln version 2.0 has caused. the <220>-<223> section to be missing from amino acid 

"bug" sequcnces(s) Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences 
""(OLD RULES) 



Sequencc(s) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



Skipped Sequences Sequence(s) 

(NEW RULES) <2 1 0> sequence id number 

<400> sequence id number 

000 



10 



11 



12 



Use of n's or Xaa's 
(NEW RULES) 



Jnvalid <213> 
Response 



Useof<220> 



_Patentln 2.0 
"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per L823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 




Sequence(s) 



missing the <770> "Feature" and associa ted numeric identifiers and responses 

se of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
Unknown. " Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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IFWO 



RAW SEQUENCE LISTING 

PATENT APPLICATION : US/10/653,680 



DATE: 03/29/2004 
TIME: 13:27:28 



Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 



<110> APPLICANT: DAI, KEN-SHWO 

<120> TITLE OF INVENTION: HUMAN SACH-RELATED VARIANTS ASSOCIATED WITH CANCERS 
<130> FILE REFERENCE: U 014797-5 
<140> CURRENT APPLICATION NUMBER: 10/653,680 
<141> CURRENT FILING DATE: 2003-09-02 
<160> NUMBER OF SEQ ID NOS : 6 
<170> SOFTWARE: Patentln version 3.2 
<210> SEQ ID NO: 1 
<2 1 1 > LENGTH: 1960 
<212> TYPE: DNA 
<213> ORGANISM /"ART I FI CI 1 
<220> 



58 
60 
61 
62 
64 
65 
66 




3 
5 
7 
9 
10 
12 
14 
16 
17 
18 
19 
21 
22 
25 
26 
27 
29 
30 
32 
34 
36 
38 
40 
41 
42 

44 cgc egg etc aac ttc age ggc ttt 

45 Arg Arg Leu Asn Phe Ser Gly Phe Gly Tyr Ser Leu Pro Gin Gin Gin 

46 5 10 15 20 
48 ccg gee gec gtg gcg cgc cgc aac gag cgc gag cgc aac cgc gtc aag 
4 9 Pro Ala Ala Val Ala Arg Arg Asn Glu Arg Glu Arg Asn Arg Val Lys 
50 25 30 35 

52 ttg gtc aac ctg ggc ttt gec acc ctt egg gag cac gtc ccc aac ggc 

53 Leu Val Asn Leu Gly Phe Ala Thr Leu Arg Glu His Val Pro Asn Gly 

54 40 45 50 

56 gcg gec aac aag aag atg agt aag gtg gag aca ctg cgc teg gcg gtc 

57 Ala Ala Asn Lys Lys Met Ser Lys Val Glu Thr Leu Arg Ser Ala Val 
55 60 65 
ate cgc gcg ctg cag cag ctg ctg gac gag cat gac gcg gtg 
He Arg Ala Leu Gin Gin Leu Leu Asp Glu His Asp Ala Val 

75 80 
gec ttc cag gca ggc gtc ctg teg ccc acc ate tec ccc aac 
Ser Ala Ala Phe Gin Ala Gly Val Leu Ser Pro Thr He Ser Pro Asn 
85 90 95 100 



FEATURE : 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY : CDS 
<222> LOCATION: (345) . . (761) 
<4 00> SEQUENCE: 1 
ggcacgaggc ttctggccag ggaacgtgga 
ggcgagtgaa agaaggaaat cagaaaggaa 
agccacggct ggagagaccg agacccggcg 
cgcgagacgc ggcagcgcag agcgcgcagc 
aggcgccgca gctgagaccg gcggccgacg 
cgcccaagca agtcaagega cagcgctcgt 



yff- 1 

Does Hot Corn 
Corrected Diskette Needed 

(fY\ OV^J 

acagggatcc ggecagggag ; i! 17 

aaaataataa aaacagcctg 
cagecttagt aggagaggaa 
gcagcagcag cagcagcagc 
agggggeggt cacaagtcag 
cttcgcccga actg atg cgc tgc aaa 
Met Arg Cys Lys 
1 

ggc tac age ctg ccg cag cag cag 



aggcgcaccg 
gggagttaac 
caagagagcg 
agcagcagca 
gccagccctc 



60 
120 
180 
240 
300 
356 



gag 
Glu 



tac 
Tyr 
70 

age gec 



404 



452 



500 



548 



596 



644 



file://C:\CRF4\Outhold\VsrJ653680.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/653 , 680 



DATE: 03/29/2004 
TIME: 13:27:28 



Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 



68 tac tec aac gac ttg aac tec atg gee ggc teg ccg gtc tea tec tac 

69 Tyr Ser Asn Asp Leu Asn Ser Met Ala Gly Ser Pro Val Ser Ser Tyr 

70 105 110 115 

72 teg teg gac gag ggc tct tac gac ccg etc age ccc gag gag cag gag 

73 Ser Ser Asp Glu Gly Ser Tyr Asp Pro Leu Ser Pro Glu Glu Gin Glu 

74 120 125 130 

76 ctt etc gac ttc acc aac tgg ttctgagggg ctcggcctgg tcaggccctg 

77 Leu Leu Asp Phe Thr Asn Trp 

78 135 

80 gtgcgaatgg actttggaag cagggtgatc gcacaacctg catctttagt gctttcttgt 
82 cagtggcgtt gggaggggga gaaaaggaaa agaaaaaaaa agaagaagaa gaagaaaaga 
84 gaagaagaaa aaaacgaaaa cagtcaacca accccatcgc caactaagcg aggcatgect 
86 gagagacatg gctttcagaa aaegggaage gctcagaaca gtatctttgc actccaatca 
88 ttcaeggaga tatgaagagc aactgggacc tgagtcaatg cgcaaaatgc agcttgtgtg 
90 caaaagcagt gggctcctgg cagaagggag cagcacacgc gttatagtaa ctcccatcac 
92 ctctaacacg cacagctgaa agttcttgct cgggtccctt cacctccccg ccctttctta 
94 gagtgcagtt cttagccctc tagaaacgag ttggtgtctt tegtctcagt agcccccacc 
96 ccaataagct gtagacattg gtttaeagtg aaactatget attctcagcc ctttgaaact 
98 ctgcttctcc tccagggccc gattcccaaa ccccatggct tccctcacac tgtcttttct 
100 accattttca ttatagaatg cttccaatct tttgtgaatt ttttattata aaaaatctat 
102 ttgtatctat cctaaccagt teggggatat attaagatat ttttgtacat aagagagaaa 
104 gagagagaaa aatttataga agttttgtac aaatggttta aaatgtgtat atcttgatac 
106 tttaacatgt aatgetatta cctctgcata ttttagatgt gtagttcacc ttacaactgc 
108 aattttccct atgtggtttt gtaaagaact ctcctcatag gtgagatcaa gaggccacca 
110 gttgtacttc agcaccaatg tgtcttactt tatagaaatg ttgttaatgt attaatgatg 
112 ttattaaata ctgttcaaga agaacaaagt ttatgeaget actgtccaaa ctcaaagtgg 
114 cagccagttg gttttgatag gttgcctttt ggagatttct attactgect tttttttctt 
116 actgttttat tacaaactta caaaaatatg tataaccctg ttttatacaa actagtttcg 
118 taataaaact ttttcctttt tttaaaatg 

121 <210> SEQ ID NO: 2 

122 <211> LENGTH: 139 

123 <212> TYPE: PRT 

124 <213> ORGANISM 

126 <220> FEATURE: 

127 <223> OTHER IN FORMAT IO; 




129 


<400> SEQUENCE: 


2 
























131 


Met 


Arg 


Cys 


Lys 


Arg 


Arg 


Leu 


Asn 


Phe 


Ser 


Gly 


Phe 


Gly 


Tyr 


Ser 


Leu 


132 


1 








5 










10 










15 




135 


Pro 


Gin 


Gin 


Gin 


Pro 


Ala 


Ala 


Val 


Ala 


Arg 


Arg 


Asn 


Glu 


Arg 


Glu 


Arg 


136 








20 










25 










30 






139 


Asn 


Arg 


Val 


Lys 


Leu 


Val 


Asn 


Leu 


Gly 


Phe 


Ala 


Thr 


Leu 


Arg 


Glu 


His 


140 






35 










40 










45 








143 


Val 


Pro 


Asn 


Gly 


Ala 


Ala 


Asn 


Lys 


Lys 


Met 


Ser 


Lys 


Val 


Glu 


Thr 


Leu 


144 




50 










55 










60 










147 


Arg 


Ser 


Ala 


Val 


Glu 


Tyr 


He 


Arg 


Ala 


Leu 


Gin 


Gin 


Leu 


Leu 


Asp 


Glu 


148 


65 










70 










75 










80 


151 


His 


Asp 


Ala 


Val 


Ser 


Ala 


Ala 


Phe 


Gin 


Ala 


Gly 


Val 


Leu 


Ser 


Pro 


Thr 


152 










85 










90 










95 




155 


He 


Ser 


Pro 


Asn 


Tyr 


Ser 


Asn 


Asp 


Leu 


Asn 


Ser 


Met 


Ala 


Gly 


Ser 


Pro 



692 



740 



791 



851 
911 
971 
1031 
1091 
1151 
1211 
1271 
1331 
1391 
1451 
1511 
1571 
1631 
1691 
1751 
1811 
1871 
1931 
1960 
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RAW SEQUENCE LISTING DATE: 03/29/2004 

PATENT APPLICATION: US/10/653 , 680 TIME: 13:27:28 

Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 

156 100 105 110 

159 Val Ser Ser Tyr Ser Ser Asp Glu Gly Ser Tyr Asp Pro Leu Ser Pro 

160 115 120 125 

163 Glu Glu Gin Glu Leu Leu Asp Phe Thr Asn Trp 

164 130 135 

167 <210> SEQ ID NO: 3 

168 <211> LENGTH: 1994 

169 <212> TYPE: DNA 

170 <213> ORGANISM^ ART 

172 <220> FEATURE: 

173 <223> OTHER INFCl 1 TIOIn 
176 <220> FEATURE: 




111 <221> NAME/KEY 
178 <222> LOCATION 
180 <400> SEQUENCE 



CDS 

(550) . . (795) 
3 

181 ggcacgaggc ttctggccag ggaacgtgga aggcgcaccg acagggatcc ggccagggag 60 

183 ggcgagtgaa agaaggaaat cagaaaggaa gggagttaac aaaataataa aaacagcctg 120 

185 agecacggct ggagagacog agacccggcg caagagagcg cagccttagt aggagaggaa 180 

187 cgcgagacgc ggcagagcgc gttcagcact gacttttgct gctgcttctg cttttttttt 240 

189 tcttagaaac aagaaggcgc cagcggcagc ctcacacgcg agcgccacgc gaggctcccg 300 

191 aagccaaccc gcgaagggag gaggggaggg aggaggaggc ggcgtgcagg gaggagaaaa 360 

193 agcattttca ctttttttgc tcccactcta agaagtctcc cggggatttt gtatatattt 420 

195 tttaacttcc gtcagggctc ccgcttcata tttccttttc tttccctctc tgttcctgca 480 

197 cccaagttgg tcaacctggg ctttgccacc cttcgggagc acgtccccaa cggcgcggcc 540 

199 aacaagaag atg agt aag gtg gag aca ctg cgc teg gcg gtc gag tac ate 591 

200 Met Ser Lys Val Glu Thr Leu Arg Ser Ala Val Glu Tyr He 

201 15 10 

203 cgc gcg ctg cag cag ctg ctg gac gag cat gac gcg gtg age gec gec 639 

204 Arg Ala Leu Gin Gin Leu Leu Asp Glu His Asp Ala Val Ser Ala Ala 

205 15 20 25 30 

207 ttc cag gca ggc gtc ctg teg ccc acc ate tec ccc aac tac tec aac 687 

208 Phe Gin Ala Gly Val Leu Ser Pro Thr He Ser Pro Asn Tyr Ser Asn 

209 35 40 45 

211 gac ttg aac tec atg gec ggc teg ccg gtc tea tec tac teg teg gac 735 

212 Asp Leu Asn Ser Met Ala Gly Ser Pro Val Ser Ser Tyr Ser Ser Asp 

213 50 55 60 

215 gag ggc tct tac gac ccg etc age ccc gag gag cag gag ctt etc gac 783 

216 Glu Gly Ser Tyr Asp Pro Leu Ser Pro Glu Glu Gin Glu Leu Leu Asp 

217 65 70 75 

219 ttc acc aac tgg ttctgagggg ctcggcctgg tcaggccctg gtgcgaatgg 835 

220 Phe Thr Asn Trp 

221 80 

223 actttggaag cagggtgatc gcacaacctg catctttagt gctttcttgt cagtggcgtt 895 

225 gggaggggga gaaaaggaaa agaaaaaaaa agaagaagaa gaagaaaaga gaagaagaaa 955 

227 aaaacgaaaa cagtcaacca accccatcgc caactaagcg aggcatgect gagagacatg 1015 

229 gctttcagaa aaegggaage gctcagaaca gtatctttgc actccaatca ttcaeggaga 1075 

231 tatgaagagc aactgggacc tgagtcaatg cgcaaaatgc agcttgtgtg caaaagcagt 1135 

233 gggctcctgg cagaagggag cagcacacgc gttatagtaa ctcccatcac ctctaacacg 1195 

235 cacagctgaa agttcttgct cgggtccctt cacctccccg ccctttctta gagtgcagtt 1255 
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237 
239 
241 
243 
245 
247 
249 
251 
253 
255 
257 
259 
261 
264 
265 
266 
267 
269 
270 
272 
274 
275 
278 
279 
282 
283 
286 
287 
290 
291 
294 
298 
299 
300 
301 
303 
304 
307 
308 
309 
311 
312 
314 
316 
318 
320 
322 
324 
326 



RAW SEQUENCE LISTING DATE: 03/29/2004 

PATENT APPLICATION: US/10/653,680 TIME: 13:27:28 

Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 

cttagccctc tagaaacgag ttggtgtctt tcgtctcagt agcccccacc ccaataagct 
gtagacattg gtttacagtg aaactatgct attctcagcc ctttgaaact ctgcttctcc 
tccagggccc gattcccaaa ccccatggct tccctcacac tgtcttttct accattttca 
ttatagaatg cttccaatct tttgtgaatt ttttattata aaaaatctat ttgtatctat 
cctaaccagt tcggggatat attaagatat ttttgtacat aagagagaaa gagagagaaa 
aatttataga agttttgtac aaatggttta aaatgtgtat atcttgatac tttaacatgt 
aatgctatta cctctgcata ttttagatgt gtagttcacc ttacaactgc aattttccct 
atgtggtttt gtaaagaact ctcctcatag gtgagatcaa gaggccacca gttgtacttc 
agcaccaatg tgtcttactt tatagaaatg ttgttaatgt attaatgatg ttattaaata 
ctgttcaaga agaacaaagt ttatgcagct actgtccaaa ctcaaagtgg cagccagttg 
gttttgatag gttgcctttt ggagatttct attactgcct tttttttctt actgttttat 
tacaaactta caaaaatatg tataaccctg ttttatacaa actagtttcg taataaaact 
ttttcctttt tttaaaatg 
<210> SEQ ID NO: 4 
LENGTH: 82 
TYPE: PRT 

ORGAN I SM (f \F(TIFICIAL^ 
FEATURE: -7-^ 
OTHER INFORMATIOlj^Tjl^N* WN 



<211> 
<212> 
<213> 
<220> 
<223> 

<400> SEQUENCE: 4 
Met Ser Lys Val Glu Thr Leu Arg Ser Ala Val Glu Tyr lie Arg Ala 
15 10 15 

Leu Gin Gin Leu Leu Asp Glu His Asp Ala Val Ser Ala Ala Phe Gin 

20 25 30 

Ala Gly. Val Leu Ser Pro Thr lie Ser Pro Asn Tyr Ser Asn Asp Leu 

35 40 45 

Asn Ser Met Ala Gly Ser Pro Val Ser Ser Tyr Ser Ser Asp Glu Gly 

50 55 60 

Ser Tyr Asp Pro Leu Ser Pro Glu Glu Gin Glu Leu Leu Asp Phe Thr 
65 70 75 80 

Asn Trp 

<210> SEQ ID NO: 5 
<211> LENGTH: 2304 
TYPE: DNA 

ORGANISM/^ AF r 1 FI< EAL 
FEATURE : 

OTHER INFO IA r J N< INKN i 1 
FEATURE: 
NAME /KEY : CDS 
LOCATION: (569) . . (805) 




<212> 
<213> 
<220> 
<223> 
<220> 
<221> 
<222> 
<4 00> SEQUENCE: 5 

ggcacgaggc ttctggccag ggaacgtgga aggcgcaccg acagggatcc ggccagggag 
ggcgagtgaa agaaggaaat cagaaaggaa gggagttaac aaaataataa aaacagcctg 
agccacggct ggagagaccg agacccggcg caagagagcg cagccttagt aggagaggaa 
cgcgagacgc ggcagagcgc gttcagcact gacttttgct gctgcttctg cttttttttt 
tcttagaaac aagaaggcgc cagcggcagc ctcacacgcg agcgccacgc gaggctcccg 
aagccaaccc gcgaagggag gaggggaggg aggaggaggc ggcgtgcagg gaggagaaaa 
agcattttca ctttttttgc tcccactcta agaagtctcc cggggatttt gtatatattt 
tttaacttcc gtcagggctc ccgcttcata tttccttttc tttccctctc tgttcctgca 



1315 
1375 
1435 
1495 
1555 
1615 
1675 
1735 
1795 
1855 
1915 
1975 
1994 



60 
120 
180 
240 
300 
360 
420 
480 
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RAW SEQUENCE LISTING DATE: 03/29/2004 

PATENT APPLICATION: US/10/653,680 TIME: 13:27:28 

Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 

328 cccaagttct ctctgtgtcc ccctcgcggg ccccgcacct cgcgtcccgg atcgctctga 540 

330 ttccgcgact ccttggccgc cgctgcgc atg gaa age tct gec aag atg gag 592 

331 Met Glu Ser Ser Ala Lys Met Glu 

332 *' 1 5 

334 age ggc ggc gec ggc cag cag ccc cag ccg cag ccc cag cag ccc ttc 640 

335 Ser Gly Gly Ala Gly Gin Gin Pro Gin Pro Gin Pro Gin Gin Pro Phe 

336 10 15 20 

338 ctg ccg ccc gca gec tgt ttc ttt gec acg gec gca gec gcg gcg gec 688 

339 Leu Pro Pro Ala Ala Cys Phe Phe Ala Thr Ala Ala Ala Ala Ala Ala 

340 25 30 35 40 

342 gca gec gec gca gcg gca gcg cag age gcg cag cag cag cag cag cag 736 

343 Ala Ala Ala Ala Ala Ala Ala Gin Ser Ala Gin Gin Gin Gin Gin Gin 

344 45 50 55 

34 6 cag cag cag. cag cag cag gcg ccg cag ctg aga ccg gcg gec ga.c ggc 784 

347 Gin Gin Gin Gin Gin Gin Ala Pro Gin Leu Arg Pro Ala Ala Asp Gly 

348 60 65 70 

350 cag ccc tea ggg ggc ggt cac aagtcagege ccaagcaagt caagegacag 835 

351 Gin Pro Ser Gly Gly Gly His 

352 • 75 

354 cgctcgtctt cgcccgaact gatgegctge aaacgccggc tcaacttcag eggctttgge 895 
356 tacagcctgc cgcagcagca getgetggae gagcatgacg eggtgagege cgccttccag 955 
358 gcaggcgtcc tgtcgcccac catctccccc aactactcca acgacttgaa ctccatggcc 1015 
360 ggctcgccgg tctcatccta ctcgtcggac gagggctctt acgacccgct cagccccgag 1075 
362 gagcaggagc ttctcgactt caccaactgg ttctgagggg ctcggcctgg tcaggccctg 1135 
364 gtgcgaatgg actttggaag cagggtgatc gcacaacctg catctttagt gctttcttgt 1195 
366 cagtggcgtt gggaggggga gaaaaggaaa agaaaaaaaa agaagaagaa gaagaaaaga 1255 
368 gaagaagaaa aaaacgaaaa cagtcaacca accccatcgc caactaagcg aggcatgect 1315 
370 gagagacatg gctttcagaa aaegggaage gctcagaaca gtatctttgc actccaatca 1375 
372 ttcaeggaga tatgaagagc aactgggacc tgagtcaatg cgcaaaatgc agcttgtgtg 1435 
374 caaaagcagt gggctcctgg cagaagggag cagcacacgc gttatagtaa ctcccatcac 14 95 
376 ctctaacacg cacagctgaa agttcttgct cgggtccctt cacctccccg ccctttctta 1555 
378 gagtgcagtt cttagccctc tagaaacgag ttggtgtctt tegtctcagt agcccccacc 1615 
380 ccaataagct gtagacattg gtttacagtg aaactatget attctcagcc ctttgaaact 1675 
382 ctgcttctcc tccagggccc gattcccaaa ccccatggct tccctcacac tgtcttttct 1735 
384 accattttca ttatagaatg cttccaatct tttgtgaatt ttttattata aaaaatctat 1795 
386 ttgtatctat cctaaccagt teggggatat attaagatat ttttgtacat aagagagaaa 1855 
388 gagagagaaa aatttataga agttttgtac aaatggttta aaatgtgtat atcttgatac 1915 
390 tttaacatgt aatgetatta cctctgcata ttttagatgt gtagttcacc ttacaactgc 1975 
392 aattttccct atgtggtttt gtaaagaact ctcctcatag gtgagatcaa gaggccacca 2035 
394 gttgtacttc agcaccaatg tgtcttactt tatagaaatg ttgttaatgt attaatgatg 2095 
396 ttattaaata ctgttcaaga agaacaaagt ttatgeaget actgtccaaa ctcaaagtgg 2155 
398 cagccagttg gttttgatag gttgcctttt ggagatttct attactgect tttttttctt 2215 
400 actgttttat tacaaactta caaaaatatg tataaccctg ttttatacaa actagtttcg 2275 
402 taataaaact ttttcctttt tttaaaatg 2304 

405 <210> SEQ ID NO: 6 

406 <211> LENGTH: 79 

407 <212> TYPE: PRT j* ; 

408 <213> ORGANISM^ARTIFICIAI^^^t^^^^ 
410 <220> FEATURE: 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 03/29/2004 

PATENT APPLICATION: US/10/653 , 680 TIME: 13:27:29 

Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 

nval id <21 3 > Re s ponse: 

se of "Artificial" only as "<213> Organism" response is incomplete, 

er 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

eq#:l,2, 3,4,5,6 
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VERIFICATION SUMMARY DATE: 03/29/2004 

PATENT APPLICATION: US/10/653 , 680 TIME: 13:27:29 

Input Set : A:\U 014797-5.ST25.txt 
Output Set: N:\CRF4\03292004\J653680.raw 
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